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Abstract 

The effective free energy of globular protein chain is considered to 
be a functional defined on smooth curves in three dimensional Eu- 
clidean space. From the requirement of geometrical invariance, to- 
gether with basic facts on conformation of helical proteins and dynam- 
ical characteristics of the protein chains, we are able to determine, in 
a unique way, the exact form of the free energy functional. Namely, 
the free energy density should be a linear function of the curvature 



*E-mail: feoli@unisannio.it; fax: +39089965275 
^E-mail: nestr@thsunl.jinr.ru 
* E-mail: scarpetta@sa.infn.it 



1 



of curves on which the free energy functional is defined. We briefly 
discuss the possibility of using the model proposed in Monte Carlo 
simulations of exhaustive searching the native stable state of the pro- 
tein chain. The relation of the model proposed to the rigid relativistic 
particles and strings is also considered. 

PACS numbers: 87.15.-v, 87.15.Cc, 05.10.Ln, 02.40.-k 
Keywords: protein folding, semi-flexible polymers, geometry of curves, 
helices, rigid string, particles with rigidity. 
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1 Introduction 



A fascinating and open question challenging physics, biochemistry and even 
geometry is the presence of highly regular motifs such as a — helices and 
(3 — sheets in the folded state of biopolymers and proteins. A wide range of 
approaches have been proposed to rationalize the existence of such secondary 
structures (see, for example, reviews PQ 12 E] and references therein) . 

In principle, in order to find the stable native state of a protein, one 
should compute, for every possible conformation of the chain, the sum of 
the free energies of the atomic interactions within the protein as well as 
with the solvent and then find the conformation with the lowest free energy 
However, it is not feasible, because the number of conformations of a protein 
chain grows exponentially with the chain length. 

In the present paper we are going to propose a pure geometrical approach 
to describe the effective free energy of proteins, proceeding from the most 
general invariance requirements and basic experimental facts concerning the 
protein conformation. It should be noted here that we shall consider not 
the Hamiltonian H (or energy) of an individual protein molecule, but the 
effective free energy of an assembly of noninteracting protein chains. We 
define this energy as such a functional F depending on the radius vector 
x(s) of a protein chain, the stationary values of which (the extrema) are 
provided only by the observed configurations of proteins. Obviously, the 
effective free energy defined in such a way is analogous to the effective action 
in quantum field theory |3]. 

Let i£[x(s)] be the energy of an individual protein chain then the effective 
free energy F we are interested in is apparently determined by the functional 
integral 



where (3 is inverse temperature and we have split the protein radius vec- 
tor x into the classical part x c ; and the quantum part x gu . In order to use 
this formula the energy functional E for an individual protein chain should 
be specified. The problem under consideration is close to the statistics of 
semi- flexible polymers. In the last case a number of models (i.e., particular 
functionals E) have been proposed which substantially rely on the differential 
geometry (Kratky-Porod model [Sj, Dirac model [HIE!, tube model jH] and 
so on). Differential geometry provides a direct relation between physics and 
mathematics and thereby opens a straight route for applying the mathemat- 
ical analysis to this physical problem If k(s) and k(s) are respectively 
the curvature and the torsion of a polymer chain then the energy E can be 
represented as a polynomial in these geometrical invariants, for example, in 
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the form [§| 



E 




(1.2) 



where A, C, a, and (3 are phenomenological parameters. 

However evaluation of the functional integral in Eq. (jl.lj) with the energy 
E given in Eq. (jl.2|) or by analogous formula is a nontrivial mathematical 
problem. The progress on this line is possible only when applying approxi- 
mate methods [Oj. As a rule, the results obtained here do not allow one to 
predict the exact stationary form of protein or polymer chains, at best one 
can calculate the statistical momenta of the chains. 

In this situation we propose another way of consideration. We shall try 
straight away to determine the effective free energy -F[x c ;] without specifying 
the energy functional E of an individual protein chain. For this purpose we 
shall use experimental knowledge concerning the protein chain configurations 
and the tools of differential geometry. 

Such a statement of the problem on describing the statistical properties 
of proteins semi-flexible polymers is non-standard one. In some sense it is an 
inverse with respect to a traditional formulation of the problem. Nevertheless 
we suppose that such an inverse task is undoubtedly of substantial interest. It 
turns out that when confining oneself to helical proteins the inverse problem 
at hand has a nice mathematical solution, being unique. It is the prove of 
this assertion that is the basic result of this paper. 

Taking into account the one-dimensional nature of the protein chains, 
the relevant macroscopic free energy F should be considered as a functional 
defined on smooth curves x(s) (or paths) in the three dimensional Euclidean 
space 



where s is the length of a protein molecule. For shortennig the formulas we 
drop the subscript cl in x. The reparametrization invariance of the functional 
F demands the free energy density T to be a scalar function depending on 
the geometrical invariants of the position vector x(s), which describes the 
spatial shape of the protein chain. In three dimensional ambient space a 
smooth curve has two local invariants: curvature k{s) and torsion k(s). In 
the general case of D dimensional Euclidean embedding space there are D — l 
principal curvatures k a (s),a = 1,2...., D — 1 of a curve, where fci(s) = k(s) 
and k2(s) = k(s). 

The basic theorem (due to O. Bonnet) in the theory of curves reads [TT] : 
given the curvatures k a (s),a = 1,2...., D — 1, a curve x(s) is determined in 
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.D-dimensional Euclidean space up to its rotations and displacements as a 
whole. The curvature k a (s) is expressed in terms of the derivatives, with 
respect to s, of the radius vector of a curve x(s) till the (a + l)-th order 
included. 

The first principal curvature, or simply the curvature, ki(s) = k(s) of 
a curve characterizes the local bending of the curve at the point s. Hence, 
the dependence of free energy density T on k(s) specifies the resistance of a 
protein chain to be bent. The second curvature or torsion k(s) is determined 
by the relative rotation, around the tangent dx.{s)/ds at the point s, of two 
neighbor infinitely short elements of the protein chain. It is well known pQ 
that, in the case of protein molecules, such a rotation is quite easy, as it 
requires little effort. In other words, this rotation results in small energy 
differences, allowing many overall conformations of a protein chain to arise. 
Thus the dependence of the free energy density T on torsion k(s) can be 
neglected at least as a first approximation. Some comments concerning this 
restriction will be given in Section 4. Finally one can consider the free energy 
density T to be a function only of the curvature k(s), i.e. T = J-(k(s)). In 
what follows we shall try to specify this dependence explicitly keeping in 
mind the description of globular protein conformation. 

A peculiarity of conformation of globular proteins is that they can be 
ordered assemblies either of helices or of sheets as well as a mixture of he- 
lices and sheets JT]. In the phenomenological macroscopic approach, which 
is developed here, the presence of sheets in the spatial structure of globu- 
lar proteins implies the necessity to introduce, in addition to space curves 
x(s), new dynamical variables y(s,s') describing surfaces in ambient space. 1 
Obviously such an extension of the problem setting would complicate consid- 
erably our consideration. Therefore we confine ourselves to helical proteins 
and try to answer the question: Is it possible to specify the function JT(A;(s)) 
in such a way that the extremals of the functional F = J Tds would be only 
helices? The answer to this question turns out to be positive and unique, 
namely, the density of the free energy J r (k(s)) should be a linear function of 
the curvature k(s): 

F(k) = a-\0\k(s), (1.4) 

where the constants a and f3 are the parameters specifying the phenomeno- 
logical model proposed, a > 0, (3 < 0. A rigorous proof of this assertion is 
the main result of the present paper. 

The layout of the paper is as follows. In Section 2 the Euler-Lagrange 

We do not consider here such exotic curves as Peano's curve which, being specified by 
one parameter s, nevertheless covers, completely tightly, a finite region of a surface (see, 
for example, Ref. [T2]1- 
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variational equations for the functional F are reformulated in terms of the 
principal curvatures. In Section 3 the explicit integrability of these equa- 
tions will be shown. In other words, for an arbitrary function !F(k(s)), the 
corresponding Euler-Lagrange equations are integrable by quadratures. The 
physical meaning of the relevant constants of integrations is also discussed. 
In Section 4 the requirement that the solutions of the Euler-Lagrange equa- 
tions describe only helices immediately confines the admissible free energy 
density JF(/c(s)) to be a linear function (jl.4j) . In conclusion (Section 5), the 
obtained results and their physical implications are briefly discussed. 

2 Euler-Lagrange equations in terms of prin- 
cipal curvatures 

Keeping in mind the description of the protein conformation, one should 
consider the problem in three dimensional Euclidean space. However, from 
the mathematical standpoint, it is convenient to formulate the equations we 
are looking for, first in D-dimensional Euclidean space and only after that 
to go to the three dimensional ambient space. 

We shall follow our papers ^Sl IT^j . where the analogous problem has 
been considered in Minkowski space-time and the results obtained have been 
applied to the models of relativistic particles described by the Lagrangians 
dependent on higher derivatives of the particle coordinates. Unfortunately, 
the transition to the Euclidean space in the final formulas of Ref. |TH| is not 
obvious. 2 Therefore we derive the equations for the principal curvatures and 
the relevant integrals of motion in Euclidean space anew. In this section we 
shall use a natural parametrization of the curve x(s) in the D-dimensional 
Euclidean space: x l (s), i = 1, 2, . . . , D. In this parametrization 



or in another way ds 2 = dx % dx % = (dxdx). As usual the sum over the 
repeated indexes is assumed in the corresponding range and, for shortening, 
the differentiation with respect to the natural parameter s will be denoted 
by a prime. 

The curve x l (s), i = 1, 2, . . . , D has D — 1 principal (or external) curva- 
tures which determine this curve up to its motion as a whole in embedding 

differential geometries of curves in Euclidean space and in Minkowski space-time differ 
substantially. In main part it is due to three types of curves existing in pseudo-Euclidean 
space: timelike, spacelike and isotropic (null) curves |15| . 




dx l dx l 
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space. The first curvature, or simply curvature, is given by 



*?(«) = = = (xY). (2.2) 

For an arbitrary functional F defined on curves x l (s) in D- dimensional 
space the Euler-Lagrange equations are a set of exactly D equations 

8F , . 
— = 0, i = l,2,. (2.3) 

However, if the functional F depends only on the curvature 

F{k(s))ds, (2.4) 



then D equations (2.3) for D variables x 1 (s), i = 1,2,..., D give D — 1 
equations for the principal curvatures k a (s), a = 1, 2, . . . , D — 1. 

When deriving the equations for k a (s) we shall use the orthonormal Frenet 
basis {e a }: 

(e a e b )=5 ab , a, b = 1,2 , D (2.5) 

associated with the curve x(s) and the Frenet equations governing the motion 
of this basis along the curve x(s) 

^ = u ab e b , u ab + u ba = 0. (2.6) 

as 

The unit vector of the Frenet basis e a (s) is expressed in terms of the deriva- 
tives of the position vector x(s) with respect to the parameter s till the order 
a included, a = 1, 2, D. The first vector ei is directed along the tangent 
of the curve at the point s 

= (2.7) 

In what follows, the explicit expressions, in terms of x(s), for the rest of 
the unit vectors e a (s) are not necessary (relevant formulas can be found, 
for example, in [TT)| I17j). Nonzero elements of the matrix uj in the Frenet 
equations (2.6) are determined by the principal curvatures 

Ua,a+l = -Ua+l,a = k a +l(s), a = 1, . . . , D - 1 . (2.8) 

In turn, the principal curvature k a (s) is expressed in terms of the deriva- 
tives of the position vector x(s) till the (a + l)-th order included. The first 
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curvature is defined in Eq. (2.2). Besides, we shall use the explicit expres- 
sion, through the derivatives of x(s), only for the second curvature or torsion 
k 2 (s) = k(s). 

The variation of the space form of the protein molecule 

5x(s) =e a (s)e a {s), a = l,...,D (2.9) 

results in the following variation of the free energy functional F defined in 
Eq. (2.4) 

5F = 5F 1 + 5F 2 , (2.10) 

where 

6F X = JdsF'ih^Shis), (2.11) 

SF 2 = J^kJSds. (2.12) 

In what follows the prime on the free energy density J-{k\) will denote the 
differentiation with respect to its argument k\. 

The variation 5 ds is calculated in a straightforward way 

Sds = sVd£ r dx l = ==^d(6x i ) = (x!d6x). (2.13) 

(JjO (JjO 

The substitution of Eq. (2.13) into 5F 2 and the subsequent integration by 
parts 3 give 

5F 2 = - Jf'ih) k[ (x'5x) ds - ffih) (x"5x) ds . (2.14) 

By making use of the Frenet equations (2.6) and the expansion (2.9) one can 
derive 

(x"5x) = (eitoc) = cu 12 (e 2 5x) = k x e 2 . (2.15) 
In view of this, Eq. (2.12) acquires the form 

SF 2 = - Jy^k'^^ds - Jj 7 (k 1 )k 1 (s)e 2 (s)ds. (2.16) 



3 We are interested in statistical applications of the ultimate equations, therefore the 
contributions depending on the position of the protein edges are dropped. 
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The calculation of the variation SFi, defined in Eq. (2.11), is more com- 
plicated. First we find the variation of the curvature Ski(s). Definition (2.2) 
and the Frenet equations (2.6) give 

fci(s) 5h(s) = (id'Sic") = (e'^x") = ki(e 2 Sic"). 

Hence 

5h = (e 2 5x"). (2.17) 
Applying Eq. (2.13) step by step one derives 

. , _rfx d dxhddSic) d . i d d . . d 

5x=5 fc = ~ds^ ds^~ = ds~ 5 *~ X (X Ts 6 ^ = d^ X " ei(ei ^ x) ' 

(2.18) 



. „ .did d . , did (id d Sic) d 2 . . . d . . d . d . . 

5X = S lb = te Sx ^ = ^ X " 2ei(ei ^ x)_ei ^ (ei ^ x) - 

(2.19) 

In view of (2.17), the last term in Eq. (2.19) does not contribute to Ski. Sub- 
stituting Eq. (2.19) into (2.17) and using the Frenet equations (2.6) together 
with expansion (2.9) we obtain 



Skt(s) = eik[ + e 2 + e 2 (k\ - k\ 



2e' 3 k 2 - e 3 k' 2 - e A k 2 k 3 . 



(2.20) 



As one could expect formula (2.20) for Ski differs from its analogue in 
Minkowski space (see Eq. (2.18) in Ref. [13] only by signs of some terms. 
However, without the direct calculation of Ski in Euclidean space the rule of 
sign changes in this expression is not obvious. 

Thus, the variation of the first curvature, Ski(s), depends on the varia- 
tions of the worldline coordinates only along the directions e 1; e 2 , e 3 , and 
e 4 (on the four arbitrary functions e a (s), a = 1, 2, 3, 4) and on the first three 
curvatures ki, k 2 , and k 3 . 

Substituting Eq. (2.20) into (2.11), integrating the latter equation by 
parts, and taking into account (2.16) we obtain 



SF= jdsi (kl-kl)F\ki) + ^{F'{ki))-kiF{ki) 



e 2 (s 



+ 



2^-{T\ki)k 2 )-k' 2 T'{ki) 
ds 



s 3 (s)-T'(ki)k 2 k 3 e 4 (s)^= 0.(2.21) 



The terms in SFi and SF 2 , depending on the variation £i{s) along the tan- 
gent to the curve, are mutually cancelled, and the variation SF depends, as 
one could expect, only on the normal variation of the curve x(s), or more 
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precisely, on the variation of x(s) along the three normals e 2 , e 3 , and e 4 . 
Apparently the last property of the variation 5F is due to a special type of 
functionals (2.4) under consideration, which depend only on the curvature of 
a curve. From Eq. (2.21) three equations for principal curvatures follow 

- (kj - k 2 2 ) F'(h) + h F{h x ) , (2.22) 

KF'ih), (2.23) 
0. (2.24) 

At first glance, Eqs. (2.22) - (2.24) are obviously insufficient in order to 
determine all D — 1 principal curvatures of the curve x(s) embedded in D- 
dimensional Euclidean space with arbitrary D. However, it is not the case. 

We start the analysis of the obtained equations from Eq. (2.24). The most 
general condition on the principal curvatures, following from this equation, 
is the requirement of vanishing the third curvature 

k 3 {s) = 0. (2.25) 

The point is that the principal curvatures, due to their construction, obey the 
conditions ^UGH]: if a curvature kj(s) does not vanish at the point s, then 
at this point the curvatures k a (s) with a = 1,2, j — 1 are also different 
from zero. But the vanishing of kj(s) at a point s, kj(s) = 0, implies that 
k a (s) — for a = j + l,j + 2, D — 1. In view of this, Eq. (2.25) entails 
the following conditions 

k 4 (s) = k 5 (s) = ... = = 0. (2.26) 

Thus, in the problem under consideration there are two non trivial equa- 
tions (2.22) and (2.23) for the curvatures ki(s) and /^(s)- Equation (2.23) 
can be integrated with arbitrary free energy density J-'(ki). Actually, for 
k 2 7^ one can rewrite this equation in the form 

k> 2 r'(h)k[ _ 

k 2 

or 

2d {\nF\h)] + d(lnk 2 ) = 0. 

Hence 

(F'(k 1 )) 2 k 2 = C, (2.27) 
where C is an integration constant. 



as 

F'iki) k 2 h = 
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Relation (|2.27|) enables one to eliminate the torsion k 2 (s) from Eq. (2.22). 
As a result we are left with one nonlinear differential equation of the second 
order for the curvature k\(s) 



ds 2 



c 2 



F'iki) - h\F{k\) =0. 



(2.28) 



Having resolved this equation for fci(s), one can determine the rest of 
curvatures by making use of Eqs. (2.27), (2.25), and (2.26). Integration of 
the Frenet equations (2.6) with principal curvatures found enables one to 
recover the curve x(s) itself. 

Notwithstanding its nonlinear character, Eq. 1)2.28)1 can be integrated in 
quadratures for arbitrary function J-(k\). To show this, the first integral for 
this equation will be constructed proceeding from the symmetry properties 
of the variational problem under study. 

3 Exact integrability of the Euler- Lagrange 
equations for principal curvatures 

The functional (2.4) possesses a quite large set of symmetries, the analysis 
of which will allow us to construct the first integral for nonlinear equation 
()2.28)) . In this section we shall use an arbitrary parametrization of the curve 
x(t). Now the functional (2.4) assumes the form 



where a dot over x denotes the differentiation with respect to the parameter 
r. In arbitrary parametrization the curvature k\ is given by 




(3.1) 




(3.2) 



Functional (3.1) is invariant under the following transformations: 



i) translations of the curve coordinates by a constant vector 



x 



x + a, 



a = const. 



(3.3) 



ii) 5'0(D)-rotations of the ambient space coordinates 



+ u) l3 = -u 3 \ i,j = 1, 2, D; 



(3.4) 
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iii) reparametrization 

r -> /(r) (3.5) 
with an arbitrary function /(r) subjected to the condition /(r) 7^ 0. 

According to the first Noether theorem, the invariance of the functional 
(3.1) under the translations (3.3) entails the conservation, under the motion 
along the curve x(s), of the 'momentum' vector 

i = l,2, ...,-D, ,3,) 

where Pq are constants. The Euler-Lagrange equations (2.3), rewritten in 
terms of P\ exhibit this manifestly 

^P* = 0, i = l,2,. ..,£>. (3.7) 

It is clear that the relations (3.6), due to their vector character, can not 
be used directly as integrals for Eqs. (2.22)-(2.24) determining the principal 
curvature k a . Simply, the vector P l can not be expressed only in terms of 
the invariants k a (s). However, one can hope that the invariant 

P 2 = P 2 = M 2 , (3.8) 

where M 2 is a new non negative constant, can be expressed through the 
principal curvatures k a (s) only. It turns out that this is really the case. 

To show this, we pass in Eqs. (3.6) from the differentiation with respect 
to arbitrary parameter r to the differentiation with respect to the natural 
parameter s (the length of a curve). For this purpose the formulas 

d rrr; d d 2 x. x 2 x — (xx)x , dk\ 1 d 2 x. 

== -^2 — — Js>_^ — — 

dr ds ' ds 2 (x 2 ) 2 9x x 2 ds 2 ' 

h ^ = j^y 4 { [3 (xx) 2 - 2 x 2 x 2 ] x - x 2 (xx) x} (3.9) 

are useful. As a result, Eq. (3.6) for the vector P, constant along the curve 
x(s), acquires the form 



ds 3 



(3.10) 
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The definitions of the parameter s (2.1) and the curvature k\ (2.2) entail the 
relations 

<ix<i 2 x\ /<ix<i 3 x\ , 9 /<i 2 x<i 3 x\ , d&i 

^ 0, — — U-A; 2 , — — )=k'. (3.11) 



(is <is 2 / \ds ds 3 J \ds 2 ds 3 J ds 

The vector x'" squared can be deduced from the definition of the second 
curvature k 2 or torsion (TTJ [TE] 

,4,9 , / dx <i 2 x <i 3 x\ 

^^ = deto U'^'^)' (3 ' 12) 

where detg (a, b, c) is the Gramm determinant for vectors a, b, and c \TQ . 
Thanks to Eq. (2.27), we obtain from (3.12) 

^ V ki+(^)\ktkt = k$+(^)\kt-^. (3.13) 



ds 3 J 1 \ds J \ds J (F'(h)) 

Squaring Eq. (3.10) and using Eqs. (3.11) and (3.13), we have 

M 2 = {F'ih) h - Hki)f + Jj^j-jy + {KT{F"{h)T . (3.14) 

In contrast to pseudo- Euclidean space [IB], the right-hand side of Eq. (3.14) 
is strictly nonnegative. 

By direct differentiation of Eq. (3.14) with respect to s one can be con- 
vinced that, for 

F"(h)^0, (3.15) 

the relation (3.14) is an integral of the nonlinear differential equation ()2.28|) . 
which determines the curvature of a stationary curve. From (3.14) we deduce 

^ = ±V / 7(^), (3-16) 



where 



2 



Integration of (3.16) gives 



(3.17) 



fc i0) fiu 

-= = ± («-*<>) (3.18) 
fcio vf( k ) 



with k w = fci(so) 



11 



Thus, if the free energy density T{ki) obeys the condition (3.15), then 
the curvature ki{s) and the torsion k 2 (s) of the stationary curve x(s) are 
the functions of the parameter s defined by Eqs. (3.18) and (2.27). The case 
when the condition (3.15) is not satisfied, i.e., when the free energy density 
J-'(ki) is a linear function of the curvature ki(s), will be considered in the 
next section. 

Closing this section we briefly consider the consequences of the invariance 
of the free energy under the transformations (3.4) and (3.5). According to the 
first Noether theorem, the rotation invariance of the functional (2.4) entails 
the conservation of the angular momentum tensor 

2 

M^ = ^(gX-gX)> Z<J ' (3 - 19) 

(T=l 

where the canonical variables q CT and p CT , o = 1, 2 are defined as follows [201 

qi = x, q 2 = x, 



Pl = P = d^~^ P2 = dx~ • (3 ' 20) 

As in the case of the translation invariance, the conserved angular mo- 
mentum tensor (3.19) does not provide by itself the integrals for the equations 
determining the principal curvatures, while the invariant S 2 constructed in 
terms of this tensor does 

W 

g2 = w (3 - 21 > 

where 

W = - M ij M ij P 2 - (M ij P j f, M 2 = P 2 . (3.22) 

In Minkowski space-time the analogous invariant is the spin of a dynamical 
system |2Ij. Direct calculation gives [T3] 

M 2 S 2 = k\ (T'f . (3.23) 

Thus the invariant S specifies the integration constant C in Eq. (2.27), namely 

C 2 = M 2 S 2 . (3.24) 

According to the second Noether theorem [22J, the invariance of the func- 
tional (3.1) under the reparametrization (3.5) results in the identity which 
is obeyed by the left-hand sides of the Euler-Lagrange equations (2.3). As 
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in the theory of relativistic strings |23], one can show that the projection of 
Eqs. (2.3) on the tangent vector x(r) is identically equal to zero 

5F ■ 

— x l = 0. (3.25) 

Thus between equations (2.3) there are only D — 1 independent ones. It is 
this fact that explains why we succeeded in obtaining the closed set of D — 1 
equation for the principal curvatures of a stationary curve. 

4 Determining the functional form of the free 
energy density 

We are going to fix the function !F(ki), requiring that all the solutions to 
the Euler - Lagrange equations are helices. From the differential geometry of 
curves fTJ E| it is known that the helices in three dimensional space have 
a constant curvature (ki) and a constant torsion (k 2 ) which determine the 
radius R and the step d of a helix 4 

R= p+~p ' d= pip- ( 4 - 1 ) 

As shown in the previous section, under the condition (3.15) the curvature 
k\ and the torsion ki of the stationary curve are not constant but they are 
the functions of the parameter s which are defined in Eqs. (3.18) and (2.27). 
Hence, for the free energy density J-'(ki) we are looking for, we have to replace 
Eq. (3.15) with the requirement 

T"{kx) = 0. (4.2) 

So T(ki) should be a linear function of the curvature k\{s) 

F(k 1 ) = a + f3k 1 (s), (4.3) 

where a and (3 are constants. Substituting Eq. (4.3) into (2.27) and (2.28) 
we obtain 

C 2 C 

^-^5' h = W (4 - 4) 

4 It is interesting to note that for an arbitrary dimension D of the ambient space, the 
curves with constant principal curvatures have a drastically different behavior in the large 
depending on whether the dimension D is even or odd. If D is even, then the curves under 
consideration are situated in a restricted part of the space. For odd D, such curves go to 
infinite in one direction. 
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Since fci(s) = |x"(s)| > 0, the constants a and (3 should have opposite signs. 
It is natural to put a > and j3 < 0. 

For the free energy density J-{k\), linear in curvature, the integral (3.14) 
gives just the relation between the integration constants M 2 and S 2 (or 
between M 2 and C 2 ) 



When a = 0, Eq. (2.28) implies that the curvature ki(s) is an arbitrary 
function of s and the integration constant C vanishes. In this case Eq. 
(2.27) yields = 0. Hence, for a = 0, the solutions to the Euler-Lagrange 
equations are arbitrary plane curves, which is evidently unacceptable for our 
purpose. 

Finally, requiring that the stationary curves for the functional (2.4) are 
only helices, we uniquely determine the free energy density, namely it should 
be a linear function of the curvature 



with nonzero constants a and /?, providing a > and (5 < 0. 

In order to get some insight into the physical meaning of the invariants 
M 2 and C 2 we calculate them for protein chains of different forms. For 
straight line chain we have 



Here it is worth noting the following. As known, in the protein physics 
the chirality [21] property of these molecules are of importance. At the first 
glance we have ignored this point because from the very beginning we have 
eliminated the dependence of the free energy density / on the curve torsion. 
But the fact is the integration constant C in our equations is responsible 
for the chirality property of the helical curves under consideration. Really, 
this constant is an analog of the Pauli-Lubanski pseudoscalar|25[ EE] (in 
the three dimensional space-time the Pauli-Lubanski vector reduces to the 
pseudoscalar) . The positive and negative values of this constant distinguish 
the left-hand and right-hand chirality of the helical curves. 

Recently, the functionals defined on the world trajectories in Minkovski 
spacetime and depending on the Cartan curvatures of those lines have been 



Ffa) = a - \P\his) 



(4.6) 



M 2 = a 2 , C = 0. 



For stationary chains (helices) these invariants are 
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considered to be a generalization of the relativistic model of a point particle 
with possible application to string theory [21] , boson-fermion transmutations 
|28j . anyon models random walks [30J and so on. It is worth noting that 
the Lagrange functions linear in a curvature of the world line prove to be 
distinguished in this field too. The richest set of appealing properties is 
provided by the Plyushchay model jSH E2], where a = 0. In this case the 
action is scale invariant. A complete set of constraints in the phase space 
of this model possesses many symmetries, specifically, local W3 symmetry. 
This model describes the massless particles, the helicity of which acquires, 
upon quantization, integer and half-odd-integer values. It is interesting that 
classical solutions in this model are the spacelike helical curves |31j . 

5 Conclusion 

Proceeding from rather general principles and making use of the basic facts 
concerning the conformation of globular proteins we have obtained, in a 
unique way, a geometrical model for phenomenological description of the 
free energy of helical proteins. It is worth noting that our functional (4.6) 
should be considered as an effective free energy of the helical protein which 
already takes into account the n atomic interactions within the protein and 
with the solvent. Hence, there is no need to quantize it, as one proceeds in 
the random walk studies |3Uj . 

Certainly our simple model does not pretend to describe all the aspects 
of the protein folding. However, one can hope that it could be employed, for 
example, in Monte Carlo simulation to search for a stable native state of the 
protein. In this case the model can be used for the description of the free 
energy of individual parts (blocks) of a protein chain that have the helical 
form. Without any doubt, it should result in simplification and acceleration 
of the exhausting searching of the native stable state of the protein chain by 
a computer pQ. 

In the general problem of the protein folding, a self containing task is to 
reveal the mechanism of the protein chain transition into the stable native 
state. A typical time of this process is such that it is completely insufficient 
to show the mutual influence of all the parts of the protein chain during the 
folding to the stable state. Presumably the functional proposed in this paper 
can be useful here too. For this purpose the coordinates of the protein chain x 
should be considered as time dependent, i.e., x = x(t, s). During its motion, 
the protein chain sweeps out a two dimensional surface in ambient space 
described parametrically by its coordinates x(t, s). In this case the analogue 
of the line curvature k(s) is the local geometrical invariant of a surface, its 
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external or mean curvature H(t, s) jllj . A straightforward generalization of 
the functional (2.4) to the dynamical problem at hand is the action S linear 
in external curvature of the surface 



where da is a differential element of the surface swept out by a protein chain 
in the course of its motion, a and b are constants. 

In the elementary particle theory the models like (5.1) are known as the 
rigid relativistic string [231 133] • One of the motivations to consider such 
strings was the attempt to develop string description for quantum chromo- 
dynamics. A peculiarity of the classical dynamics of such string models is 
its instability [31]. The reason of this is the dependence of the action (5.1) 
not only on the velocity dx(t, s)/dt but also on the acceleration <i 2 x(t, s)/dt 2 
of the protein chain. As a result, the energy of such systems proves to be 
unbounded from below [35J. It is very likely that this instability could be cru- 
cial in describing, in the framework of the string model (5.1), the transition 
of the protein chain to the stable native state. 

Acknowledgements 

This study has been conducted during the stay of one of the authors (V.V.N.) 
at Salerno University. He would like to thank G. Scarpetta, G. Lambiase, 
and A. Feoli for the hospitality extended to him. One of the authors (A.F.) 
wishes to thank A. Rampone for useful discussions and references about the 
protein folding problem. The financial support of INFN is acknowledged. 
V.V.N, was partially supported by the Russian Foundation for Basic Research 
(Grant No. 03-01-00025) and by the International Science and Technology 
Center (Project No. 840). 

References 

[1] H.S. Chan and K.A. Dill, Physics Today, 46, No. 2 (1993) 24. 
[2] K.A. Dill, Protein Science, 8 (1999) 1166. 

[3] F.R. Banavar, A. Maritan, C. Micheletti, and F. Seno, Geometrical 
aspects of protein folding, |cond-m at / 0105209 , 

[4] P. Ramon, Field Theory. A Modern Primer, The Benjamin/Cummings 
Publishing Company Inc., London, 1981. 




(5.1) 



16 



[5] O. Kratky, G. Porod, Reel. Trav. Chim. 68 (1949) 237. 

[6] A. Kholodenko, Phys. Lett. A 141 (1989) 351; Ann. Phys. (N. Y.) 202 
(1990) 186. 

[7] A. Kholodenko, M. Ballauff, M. Aguero Granados, Physica A, 260 (1998) 
267. 

[8] J.R. Banavar, A. Maritan, Rev. Mod. Phys. 75 (2003) 23. 

[9] R.D. Kamien, Rev. Mod. Phys. 74 (2202) 953. 

[10] S. Hyde, S. Anderson, K. Larsson, The Language of Shape, Elsevier, 
Amsterdam, 1997. 

[11] L.P. Eisenhart, Riemannian Geometry, Princeton University Press, 
Princeton, 1964; 

M.P. Do Carmo, Differential Geometry of Curves and Surfaces, Prentice- 
Hall, London, 1976; 

M. Spivak, A Comprehensive Introduction to Differential Geometry, 
Publish or Perish Inc., Houston, 1979. 

[12] R. Salem and A. Zygmund, Duke Math. J. 12 (1945) 559. 

[13] V.V. Nesterenko, A. Feoli and G. Scarpetta, J. Math. Phys. 36 (1995) 
5552. 

[14] V.V. Nesterenko, A. Feoli and G. Scarpetta, Class. Quantum Grav. 13 
(1996) 1201. 

[15] B. O'Neill, Semi - Riemannian Geometry, Academic, New York, Lon- 
don, 1983; 

K. L. Duggal and A. Bejancu, Lightlike Submanifolds of Semi - Rie- 
mannian Manifolds and Applications, vol. 364 of Mathematics and its 
Applications, Kluwer, Dordrecht, The Netherlands, 1996. 

[16] M.M. Postnikov, Lectures on Geometry. Semester III: Smooth Mani- 
folds, Nauka, Moscow, 1987. 

[17] Yu.A. Aminov, Differential Geometry and Topology of Curves, Nauka, 
Moscow, 1987. 

[18] P.A. Griffiths, Exterior differential systems and the calculus of varia- 
tions, Birkhauser, Boston, 1983). 



17 



[19] G.A. Korn and T.M. Korn, Mathematical Handbook, McGraw-Hill, New 
York, 1968. 

[20] V.V. Nesterenko, J. Phys. A 22 (1989) 1673. 

[21] S.S. Schweber, An Introduction to Relativistic Quantum Field Theory, 
Row Peterson, New York, 1961. 

[22] B.M. Barbashov and V.V. Nesterenko, Fortschritte der Physik, 31 (1983) 
535. 

[23] B.M. Barbashov and V.V. Nesterenko, Introduction to the Relativistic 
String Theory, World Scientific, Singapore, 1990. 

[24] A.B. Harris, R.D. Kamien, and T.C. Lubensky, Rev. Mod. Phys. 71 
(1999) 1745. 

[25] V. V. Nesterenko, J. Math. Phys. 32, 3315 (1991). 

[26] V. V. Nesterenko, J. Math. Phys. 34, 5589 (1993). 

[27] A.M. Polyakov: Nucl. Phys. B 286 (1986) 406; 
H. Kleinert, Phys. Lett. B 174 (1986) 335; 
V.V. Nesterenko, J. Math. Phys. 32 (1992) 3315; 
M.S. Plyushchay, Phys. Lett. B 253 (1991) 50; 

H. Arodz, A. Sitarz, and P. Wegrzyn, Acta Phys. Polonica B 20 (1989) 
921; 

M. Pavsic, Phys. Lett. B 205 (1988) 231; 

J. Grundberg, J. Isberg, U. Lindstrom and H. Nordstrom, Phys. Lett. 
B 231 (1989) 61; 

J. Isberg, U. Lindstrom and H. Nordstrom, Mod. Phys. Lett. A 5 (1990) 
2491. 

[28] A.M. Polyakov, Mod. Phys. Lett. 3 A (1988)325; 

V.V. Nesterenko, Class. Quantum Grav. 9 (1992) 1101; 

M.S. Plyushchay, Nucl. Phys. B 362 (1991) 54; 

S. Iso, C. Itoi, and H. Mukaida, Nucl. Phys. B 346 (1990) 293. 

[29] V.V. Nesterenko, J. Math. Phys. 34 (1993) 5589. 

[30] R.D. Pisarski, Phys. Rev. D 34 (1986) 670; 

J. Ambjorn, B. Durhuus, and T. Jonsson, J. Phys. A 21 (1988) 981; 
A.L. Kholodenko, Ann. Phys. 202 (1990) 186. 



18 



[31] M.S. Plyushchay, Mod. Phys. Lett. A 4 (1988) 837; 
M.S. Plyushchay Int. J. Mod. Phys. A 4 (1989) 3851; 
M.S. Plyushchay Phys. Lett. B 243 (1990) 383. 

[32] C. Batlle, J. Gomis, J.M. Pons, and N. Roman-Roy J. Phys. A: Math. 
Gen. 21 (1989) 2693. 

[33] A.L. Kholodenko and V.V. Nesterenko, J. Geometry and Physics 16 
(1995) 15. 

[34] T.L. Curtright, G.I. Ghandour, C.B. Torn, and C.K. Zachos, Phys. Rev. 
Lett. 57 (1986) 799; 

T.L. Curtright, G.I. Ghandour, and C.K. Zachos, Phys. Rev. D 57 
(1986); 

E. Braaten and C.K. Zachos, Phys. Rev. D 35 (1987) 1512; 

F. David and E. Guitter, Europhys. Lett. 3 (1987) 1169. 

[35] A.M. Chervyakov and V.V. Nesterenko, Phys. Rev. D 48 (1993) 5811. 



19 



